The Journal of the Acoustical Society of America — Latest Matching Preprints

1

Auditory perceptual expertise: Amplitude modulation rate discrimination near the threshold for detection

Garcia Ruiz, T.; Sanes, D. H.

2026-05-11 animal behavior and cognition 10.64898/2026.05.06.723339 medRxiv

Top 0.1%

8.0%

Show abstract

Many perceptual skills improve with a few days of training. However, weeks or months of practice may be required to reach a level of expertise on complex tasks (Watson, 1980). Here, we explored how gerbils attain expertise on a difficult task: amplitude modulation (AM) rate discrimination at very shallow AM depths, similar to the depths used during vocal communication. Using an appetitive Go-Nogo procedure, we first trained 6 gerbils to perform an AM discrimination task (Nogo: 4 Hz; Go: 4.25-10 Hz) at a depth of 0 dB (re: 100% depth). Animals were then trained to perform AM discrimination at successively shallower depths, from -3 to -18 dB, requiring an average of 5-10 days of practice to reach a performance metric of d[≥]1 for each depth. Finally, we determined that AM discrimination thresholds were nearly identical between 0 to -12 dB, and only slightly elevated at -15 dB. Improvements in performance were accompanied by a large reduction in response time during procedural learning, and a gradual reduction of response time during perceptual learning, even as AM depth became shallower (i.e., more difficult). The shallowest depth at which gerbils displayed peak performance on the AM discrimination task is similar to their lowest AM depth detection thresholds. These results suggest performance on challenging auditory perceptual tasks require prolonged practice, and is accompanied by increased automaticity (i.e., lower response time) that stabilizes once expertise is achieved.

2

Differentiating the Physiological Signatures of Cochlear Synaptopathy and Inner Hair Cell Damage in a Chinchilla Model

Sivaprakasam, A.; Schweinzger, I.; Heinz, M.

2026-05-08 neuroscience 10.64898/2026.05.05.723072 medRxiv

Top 0.1%

6.6%

Show abstract

Aging and noise over-exposure lead to complex mixtures of cochlear degradation that impair the structure and function of outer hair cells, inner hair cells (IHCs), and the cochlear nerve. However, IHC damage and cochlear synaptopathy (CS) remain pathologies "hidden" from the audiogram. This study aimed to identify and differentiate the physiological signatures of these two distinct pathologies using promising non-invasive assays: Envelope Following Responses (EFRs), Auditory Brainstem Response (ABRs), Wideband middle-ear reflexes (WB-MEMRs), and Distortion Product Otoacoustic Emissions (DPOAEs). We utilized chinchilla models of carboplatin-induced (CA) IHC damage (N = 4) and temporary threshold shift (TTS) noise-induced CS (N = 4) to compare the physiological signatures of each pathology. While both groups showed unchanged ABR thresholds two weeks after exposure, EFRs, ABR Wave V/I ratios, and MEMRs showed distinct effects of exposure. Despite non-elevated ABR-derived audiometric thresholds after exposure, both CA and TTS exposure resulted in severe in EFR "peakiness", particularly for sharp, short-duty-cycle stimuli and significant elevations in ABR Wave V/I ratios. However, these findings were less-pronounced in the TTS-exposed animals. WB-MEMR amplitudes were decreased with elevated thresholds in both groups; this effect was more pronounced in the TTS group. Opposite trends in DPOAE amplitudes indicated that while both IHC damage and CS result in similar suprathreshold temporal coding deficits, effects on outer-hair-cell integrity and auditory efferent physiology may differ between the two pathologies. Future work and novel diagnostics should aim to distinguish these specific cochlear pathologies in clinical populations, or at the very least consider their overlap. HighlightsO_LIA multi-metric diagnostic approach was used with chinchilla models of inner-hair-cell (IHC) damage and cochlear synaptopathy (CS). C_LIO_LIIHC damage and synaptopathy both cause suprathreshold deficits "hidden" from the audiogram. C_LIO_LIIHC damage results in more severe temporal envelope coding degradation than does synaptopathy. C_LIO_LIA combination of EFR "peakiness", ABR Wave V/I ratio, and Wideband Middle Ear Muscle Reflex (WB-MEMR) appear to be useful measures for profiling IHC damage and CS. C_LI

3

Quantifying the vocal repertoire of adult common terns (Sterna hirundo )

Zogby, D. S.; Eddington, V. M.; Craig, E. C.; Kloepper, L. N.

2026-05-22 animal behavior and cognition 10.64898/2026.05.20.722623 medRxiv

Top 0.1%

4.8%

Show abstract

Common terns (Sterna hirundo) are regionally threatened migratory seabirds that form large breeding colonies during the North American summer months. They are highly vocal and serve as important bioindicators of aquatic ecosystems. Historically, acoustic studies on colonial seabirds have proven difficult due to the dense aggregations of individuals and high rate of call overlap. However, as passive acoustic monitoring (PAM) becomes increasingly common for studying seabird colonies, quantitative descriptions of species vocalizations are needed to accurately interpret behavioral information from colony soundscapes and support automated analysis of large acoustic datasets. This study aims to quantify the vocal repertoire of adult common terns. We deployed AudioMoths to collect acoustic data at a tern colony on Seavey Island, New Hampshire, USA from across the breeding season. Using RavenPro, unique call types were identified through visual and aural inspection of the acoustic data in the spectrogram. For each call, we then extracted measurements of peak frequency (Hz), bandwidth 90% (Hz), syllable duration 90% (s), and total bout duration (s) to quantify the characteristics of each call type. Statistical analyses for acoustic parameters by call type were performed using Kruskal-Wallis tests, followed by post-hoc Dunn tests. Our results demonstrate that each call type is significantly different from another by at least one parameter, with the exception of the kek and kip/tjuk calls. These findings present the first quantitative analysis of common tern vocalizations for North America. By defining temporal and spectral characteristics for multiple call types, this work helps translate colony soundscape into biologically meaningful information about tern behavior and colony dynamics. These descriptions also provide key parameters for developing automated tools to detect and classify vocalizations in dense, noisy colonies. Integrating quantified vocal characteristics with PAM offers a promising approach for monitoring colony activity and behavior while minimizing disturbance relative to traditional methods.

4

Auditory Working Memory Mediates the Relationship between Musical Sophistication and Speech-in-noise Perception

Colak, H.; Benzaquen, E.; Guo, X.; Lad, M.; Sedley, W.; Griffiths, T. D.

2026-05-13 neuroscience 10.64898/2026.05.13.724783 medRxiv

Top 0.1%

3.5%

Show abstract

Understanding speech in noisy environments (SPIN) is an important everyday ability, and engaging in musical activities has been proposed as a factor that may support this ability. However, the cognitive mechanisms underlying a potential musical advantage in SPIN perception remain unclear. Here we investigated whether musical sophistication is associated with better SPIN perception in a large population-based sample, and whether this relationship is mediated by auditory working memory (AWM), verbal working memory (VWM), or non-verbal intelligence. We recruited 203 participants and measured SPIN perception at both word and sentence levels. Musical sophistication was assessed using the Goldsmiths Musical Sophistication Index (Gold-MSI). AWM was measured using delayed matching of tone frequency or the modulation rate of amplitude modulated white noise, VWM was based on backward digit span task, and non-verbal intelligence used matrix reasoning. Mediation analyses revealed that AWM fully mediated the relationship between musical sophistication and SPIN perception, whereas VWM showed no mediation effect. Non-verbal intelligence showed a partial mediating effect. Additional control analyses using structural equation modelling revealed that the indirect effect through AWM remained significant after accounting for age, hearing thresholds, and non-verbal intelligence. Together, these findings suggest that individuals with greater musical sophistication demonstrate better daily life listening abilities, and that superior auditory working memory may be the key cognitive mechanism underlying this advantage.

5

Sound Localization Is Biased By Simultaneous And Delayed By Preceding Visual Distractors

Rocchi, F.; Haukes, N. C.; van Opstal, A. J.; van Wanrooij, M. M.

2026-05-15 neuroscience 10.64898/2026.05.12.724474 medRxiv

Top 0.1%

2.0%

Show abstract

AO_SCPLOWBSTRACTC_SCPLOWVision can shape auditory perception, especially when visual cues occur at different times and locations than sounds. Simultaneous but spatially misaligned lights bias the perceived location of a sound--a phenomenon known as the ventriloquism effect. Temporally misaligned lights can also affect the latency of auditory responses. However, it remains unclear how multiple visual stimuli that differ from sounds in both space and time jointly influence localization behaviour. We investigated how visual distractors, spatially misaligned by 10{degrees}, presented before and/or during a target sound influence localization accuracy and response latency in a rapid head-pointing task. Human listeners localized brief (150 ms) broadband noise bursts with an average root-mean-square error of 5{degrees} and a baseline latency of 252 ms. Simultaneous visual cues induced the ventriloquism effect, in which the perceived sound location was biased by 1.8{degrees}. Response latency also increased by 21 ms (273 ms). Preceding visual stimuli (2 s duration) did not induce a bias, but increased latency by 55 ms (307 ms). Introducing a 200 ms gap between the preceding light and the sound reduced this latency increase to 24 ms (276 ms), still not inducing a significant bias. When we presented both a preceding and a simultaneous light on opposite sides of the sound, localization reflected the bias induced by the simultaneous light (1.8{degrees}) and the latency increase induced by the preceding light (by 48 ms). These findings reveal a dissociation in audiovisual integration: preceding visual stimuli primarily influence when a sound is responded to (latency), while simultaneous stimuli influence where it is perceived (accuracy). This supports causal inference models of multisensory integration and suggests distinct underlying mechanisms for spatial and temporal processing of sounds in sensorimotor circuits.

6

Beyond Onset Timing: Longer Sound Envelope Duration Enhances Neural Representation of the Musical Beat

Rosenzweig, F.; Lenoir, C.; Lenc, T.; Polak, R.; Huart, C.; Nozaradan, S.

2026-05-13 neuroscience 10.64898/2026.05.12.721298 medRxiv

Top 0.1%

1.7%

Show abstract

Musical rhythm is often experienced with a periodic beat, serving as a temporal reference for coordination with the rhythm. Thus far, models of beat processing have mainly relied on representing sensory inputs as patterns of onset timing, with limited consideration of other sensory features. Here, we challenge this view by showing that the internal representation of beat is affected by other temporal features of the stimulus beyond onset timing alone. We recorded electroencephalography (EEG) while participants listened to rhythmic sequences designed to elicit a beat. Across conditions, we manipulated the duration of the tones conveying the rhythms, while keeping all other parameters identical, including overall intensity, speed, and rhythmic pattern structure. Crucially, the beat periodicity was enhanced in neural activity with increased sound duration, even though the beat periodicity was not prominent in the acoustic features, thus ruling out basic sensory confounds. These results demonstrate the preferential role of longer sound durations in fostering temporal scaffolding processes that integrate fast rhythmic inputs into behavior-relevant internal structures such as the beat. More generally, our findings are compatible with a holistic processing account whereby a range of features beyond onset timing may be integrated into a neural representation of rhythm. Graphical Abstract: Fig. 2EEG was recorded while listeners heard rhythmic sequences eliciting a beat. Sound duration (sonic duty cycle) was varied across four conditions while speed, pattern, and intensity stayed constant. Beat-related EEG responses increased with longer sounds, and were enhanced in all conditions compared to auditory nerve model envelopes, which did not show prominent energy at the beat periodicity, ruling out sensory confounds. Results support holistic rhythm processing beyond onset timing alone. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=101 SRC="FIGDIR/small/721298v1_fig2.gif" ALT="Figure 2"> View larger version (27K): org.highwire.dtl.DTLVardef@10a0599org.highwire.dtl.DTLVardef@f5a95forg.highwire.dtl.DTLVardef@42d1ceorg.highwire.dtl.DTLVardef@dc58a7_HPS_FORMAT_FIGEXP M_FIG O_FLOATNOFigure 2.C_FLOATNO EEG and auditory nerve model output analysis based on magnitude spectrum and autocorrelation. Each row represents a duty cycle condition. The two columns on the left represent the magnitude spectrum-based analysis. The first column represents the group-level averaged magnitude spectra at a pool of fronto-central electrodes, across conditions. Beat-related frequencies are shown in red, and beat-unrelated frequencies are shown in blue. Scalp topographies of the neural activity measured at the average magnitudes of beat-related (in red circle) and unrelated (in blue circle) frequencies are represented as insets. The second column represents the normalized magnitude spectra obtained from the auditory nerve model output for each duty cycle sequence. The two columns on the right represent the autocorrelation-based analysis (for visualization purposes, only a subset of lags from 0 to 2.4 s corresponding to the pattern duration is shown). The first column represents the group-level averaged autocorrelation function measured from the same pool of fronto-central electrodes, across conditions. Beat-related lags are shown in red, and beat-unrelated lags are shown in blue. The second column represents the autocorrelation function of the auditory nerve model output for each duty cycle sequence. C_FIG

7

An Open Reproducible Framework for CNN-Based Cetacean Vocalization Detection in Passive Acoustic Monitoring

De Marco, R.

2026-05-06 animal behavior and cognition 10.64898/2026.05.01.721665 medRxiv

Top 0.1%

1.5%

Show abstract

This paper presents a six-stage methodological framework for Convolutional Neural Net-work (CNN)-based cetacean vocalization detection and classification in Passive Acoustic Monitoring (PAM), implemented as the open-source toolkit ai-pam-pipeline. The frame-work is generalizable across species and fully parameterised through a single configuration file, guaranteeing exact experimental reproducibility. Two experiments are reported. Experiment A examines the effect of FFT window length Nfft [isin] {256, 512, 1024} on binary Bottlenose dolphin (Tursiops truncatus) whistle detection using stratified 10-fold cross-validation on an in-domain dataset (Oltremare, 192 kHz) and a cross-domain benchmark (DCLDE 2022). In-domain performance is uniformly high (macro F1{approx} 0.98; Wilcoxon, all p > 0.05). Cross-domain results diverge substantially: Nfft = 256 is significantly superior (p = 0.006, rank-biserial r = 0.89). The mechanism is an upsampling amplification effect: coarser spectral bins produce wider, higher-contrast FM traces after bilinear resampling to fixed image dimensions. This superiority is threshold-invariant: precision equals 1.000 across all configurations and thresholds{theta} [isin] [0.1, 0.9], confirming that the advantage is not an artifact of threshold choice. These findings demonstrate that preprocessing choices -- often treated as secondary implementation details -- can significantly affect cross-domain generalisation. While Nfft serves here as a controlled case study, the framework is designed to enable systematic, reproducible evaluation of arbitrary preprocessing parameters within a unified experimental protocol. Experiment B demonstrates multiclass capability on five T. truncatus vocalization cate-gories (macro F1 = 0.843); inter-class confusion between click trains and burst-pulse sounds reflects biological signal overlap rather than classifier failure.

8

Individual variation in sound localization accuracy is correlated with the properties of eye movement-related eardrum oscillations (EMREOs)

Herche, J. L.; King, C. D.; Groh, J. M.

2026-05-14 neuroscience 10.64898/2026.05.13.724941 medRxiv

Top 0.1%

1.5%

Show abstract

Calibration of sound localization behavior in species with mobile eyes requires not only accurate visual input but also accurate oculomotor signals across the lifespan. The recent discovery of eye movement-related eardrum oscillations suggest that oculomotor signals may be incorporated into auditory processing at the level of the ear. One inference of this discovery is that individual variation in such signals might be correlated with individual variation in sound localization accuracy. Here, we tested this hypothesis in humans with normal hearing. We discovered that there is considerable variation in the accuracy of sound localization (here, saccades to sounds) even in normal individuals: median horizontal errors ranged from 2-6{degrees}, and median vertical errors could be as large as 36{degrees}. We separated the subject pool into groups with "good" performance (median vectorial error < 8{degrees}) vs "poor" performance (median vectorial error > 10{degrees}) and evaluated their respective EMREOs. The EMREOs differed across the two groups in both horizontal and vertical dimensions, in how saccade amplitude vs. initial eye position was encoded, and across time with respect to the saccade. These results are consistent with the interpretation that EMREOs are associated with underlying processes that ensure the accuracy of sound localization. HIGHLIGHTSO_LIThe accuracy of eye movements to look at sounds varied across individuals, with median errors spanning a greater than 10-fold range. This range is surprising given that the participants passed screening for normal hearing. C_LIO_LI"Good" vs "poor" sound localizers exhibited differences in their eye movement-related eardrum oscillations (EMREOs) C_LIO_LIEMREOs differed in both horizontal and vertical sensitivity, for both saccade amplitude and initial eye position, and the differences varied in timing with respect to saccade onset. C_LIO_LIWe interpret the results under the theory that poor sound localization may be a consequence of poor eye movement encoding, without which linking visual and auditory space is likely inaccurate. C_LI

9

Peripheral anatomy of the dolphin ear and associated nervous structures: insights from macroscopic dissection, DICE-{micro}CT, histology, and confocal microscopy

De Vreese, S.; Graïc, J.-M.; Mazzariol, S.; Huggenberger, S.; Fogli, M.; Luzzati, F.; Corona, C.; Favole, A.; Cerda-Domenech, M.; Frigola, J.; Andre, M.

2026-05-19 neuroscience 10.64898/2026.05.15.725593 medRxiv

Top 0.2%

0.9%

Show abstract

The peripheral auditory system of dolphins comprises specialised bony, fatty, vascular, and neural structures adapted for underwater hearing and diving physiology. These include the external ear canal, acoustic fat bodies, sinuses, and associated neurovascular networks, which together support sound conduction, protection, and possibly sensory functions. Despite advances in gross anatomical description, the detailed integration of these tissues, particularly the innervation, neurovascular organisation, and their functional implications, remains poorly understood. Previous studies have described the presence of sensory nerve formations and vascular plexuses, but their arrangement, connectivity, and relation to each other are unresolved. Here, we combine macroscopic dissection, DICE-{micro}CT, histology, and high-resolution confocal microscopy to characterise several neurovascular and sensory components of the dolphin peripheral auditory system in several delphinid species. Macroscopic dissection and DICE-{micro}CT revealed the traditional acoustic fat body distribution with detailed morphology of the posterolateral extension that is not well-known. The cranial nerve distribution, and specifically the mandibular nerve branching patterns, are described in detail. Confocal microscopy uncovered a stratified neurovascular plexus around the external ear canal with a complex sensory system comprising lamellar corpuscles, Merkel cell-neurite complexes, and intraepithelial nerve fibres. Notably, the lamellar corpuscles formed a continuous, three-dimensional neural network with frequent merging and splitting of axonal bundles, shared perineuria, and vascular integration, features not observed in previous studies. Our findings demonstrate that the dolphin external ear canal and surrounding structures form a sophisticated, multimodal somatosensory organ, integrating structural, vascular, and neural specialisations likely adapted for proprioceptive mechanosensation in the aquatic environment. This study provides insights into the integration of the various components of the peripheral hearing apparatus. Future studies integrating anatomical, electrophysiological, and biomechanical approaches are needed to fully elucidate these adaptations.

10

A comparison of BirdNET, expert listening and acoustic indices to monitor avian diversity in a Mediterranean agricultural landscape

Akoglu, I.; Bacak, E.; Bilgin, S.; Boyla, K. A.; Duran, M.; Akcay, C.; Ertor-Akyazi, P.

2026-05-21 ecology 10.64898/2026.05.20.726349 medRxiv

Top 0.2%

0.8%

Show abstract

Passive acoustic monitoring poses an immense potential to assess avian diversity in many habitats, including agricultural landscapes. At the same time, automated recorders generate large datasets which present a challenge for processing and effectively assessing biodiversity. Methods such as manual listening by experts, automated detection algorithms like BirdNET and calculating acoustic indices all present different trade-offs in assessment of biodiversity through passive acoustic monitoring. In the present study we recorded soundscapes in a low-intensity agricultural landscape in western Turkiye in all four seasons. Two expert ornithologists listened to a subset of these recordings identifying bird species from the recordings. We also ran the same sample of recordings on BirdNET to compare BirdNET detections with expert detections and calculated acoustic indices for each recording. The results showed that BirdNET detected more species than experts, although some may not be reliable detections. Two acoustic indices (bioacoustic index and acoustic complexity index) were correlated positively with number of species detected by experts and one (normalized difference soundscape index) with number of species detected by BirdNET but the correlations were modest. The results show that acoustic indices may have limited value in detecting biodiversity and automated detection algorithms may do a better job, although these may need to be trained with local data to improve detection and classification.

11

Tuning into the city soundscape: Optimizing Convolutional Neural Networks for avian acoustic identification in the neotropics and evaluating their performance against established monitoring approaches.

Ardila-Villamizar, M.; De Clippele, L. H.; Dominoni, D. M.

2026-05-13 ecology 10.64898/2026.05.10.724049 medRxiv

Top 0.2%

0.7%

Show abstract

Convolutional Neural Networks (CNNs) have become increasingly prominent in biodiversity monitoring due to their strong performance in accurately detecting species from sound recordings, overcoming some limitations of traditional methods such as point-counts. Yet, their use in urban ecosystems remains limited, highlighting the need for frameworks that identify modelling strategies to optimize their performance in these complex soundscapes. Here, we evaluated how preprocessing and labelling strategies, detection thresholds, sample size, and architecture affect the performance of CNNs for bird identification in urban tropical ecosystems. We also assessed its potential by comparing CNN-derived biodiversity estimates with those from point-counts and acoustic indices. For this, we used one week of recordings collected along urbanization gradients in five Colombian Andes cities to developed 11 multiclass CNN models varying in spectral representation, labelling strategies, training data source and backbone architecture. The best-performing model, evaluated with F1-scores, combined Log-Mel spectrograms, multispecies labels, ecosystem-specific recordings, a probability threshold of 0.3 and a ConvNeXt backbone with its performance generally improving with sample size. Although CNNs and point counts detected partially distinct assemblages, CNN-derived species richness was comparable to that estimated from point-counts. In addition, the Normalized Difference Soundscape Index (NDSI) was positively associated with richness, suggesting its potential as a biodiversity proxy in tropical urban soundscapes. Overall, by identifying effective modelling designs and monitoring strategies, our study advances the development of robust biodiversity assessment frameworks in urbanized ecosystems in the Neotropics whilst also providing methodological guidance for future research and practical insights for wildlife monitoring and conservation.

12

How do we align in good conversation? Investigating the link between interaction quality and multimodal interpersonal coordination

Dominguez-Arriola, M. E.; Lam, P. C. H.; Perez, A.; Pell, M. D.

2026-05-11 neuroscience 10.64898/2026.05.09.723997 medRxiv

Top 0.3%

0.4%

Show abstract

Conversations can feel effortlessly engaging or, conversely, difficult and unrewarding. Multiple factors contribute to the experienced quality and outcomes of a conversation, among them how interlocutors align with each other. The present study investigated speech-to-speech, brain-to-speech, and brain-to-brain coordination as markers of interpersonal alignment, examining their relationship with jointly perceived interaction quality and mutual affinity between conversational partners. Pairs of previously unacquainted participants (dyads) engaged in multiple short, free-form conversations on topics of varying interest while their vocal and neural activity were simultaneously recorded in a dual-EEG ("hyperscanning") setup. We analyzed interlocutors prosodic adaptation, neural speech tracking, and neural coordination during each conversation. At the speech-to-speech level, our findings reveal that partners with more positive mutual impressions became more similar in their volume and voice quality over the course of the experiment session, reflecting greater prosodic convergence. At the brain-to-speech level, we found no reliable effect of interaction quality on neural tracking of unfolding speech within any individual region, although topographical differences suggested relative modulation across scalp sites. Finally, at the brain-to-brain level, our findings show that higher perceived interaction quality enhanced inter-brain relationships across frequency bands (alpha and theta) and temporal dependencies (concurrent/near-instantaneous and recurrent/listener-lagging), with the strongest effects observed for concurrent alpha-band coupling. These findings suggest that distinct coordination processes are involved in how interlocutors experience an interaction and how they establish relational affinity, casting new light into the mechanisms that make a conversation worthwhile.

13

Optogenetic cochlear stimulation evokes midbrain activity with near-physiological temporal fidelity

Koert, E.; Götz, J.; Albrecht, N.; Vavakou, A.; Wolf, B. J.; Moser, T.

2026-05-19 neuroscience 10.64898/2026.05.16.724905 medRxiv

Top 0.3%

0.4%

Show abstract

When hearing fails, stimulation of the auditory nerve by electrical cochlear implants (eCIs) partially restores hearing, with most eCI users achieving open speech understanding. However, the broad current spread from each electrode limits frequency coding and speech understanding in daily situations with background noise. Spatially confined optogenetic stimulation by future optical cochlear implants (oCIs) improves frequency coding but millisecond closing kinetics of channelrhodopsins (ChRs) might limit temporal coding. Here, we evaluated the utility of fast-closing ChR f-Chrimson for processing temporal information in the auditory system of Mongolian gerbils. We recorded neural activity in the inferior colliculus evoked by f-Chrimson-mediated optogenetic stimulation of the cochlea. F-Chrimson enabled energy-efficient stimulation of the auditory pathway at rates [≥]150 Hz, outperforming the slower ChR variants CatCh (blue) and ChReef (green). Energy thresholds for activation of the auditory pathway were in the low {micro}J range, between ChReef (sub-{micro}J) and CatCh. Dynamic range and frequency selectivity were comparable to previous observations with CatCh and outperformed electrical stimulation. In conclusion, employing fast-gating ChRs harnesses improved spectral coding without degrading temporal coding. The Paper ExplainedO_ST_ABSProblemC_ST_ABSElectrical cochlear implants (eCIs) partially restore speech comprehension in most of 1 million otherwise severely deaf people. However, most CI-users face challenges hearing in daily situations. Spectrally more selective stimulation of the auditory nerve by optical cochlear implants (oCIs) promises to overcome this limitation. However, the closing kinetics of channelrhodopsins (ChR) limit the temporal bandwidth of bionic sound coding. Improving the ChR properties and evaluating temporal coding remain major objectives for developing hearing restoration by oCI. ResultsHere, we evaluate the utility of waveguide-based oCI using the fast-closing ChR Chrimson (f-Chrimson) for encoding of temporal, spectral and intensity information by multi-electrode-array (MEA) recordings from the midbrain. We compare f-Chrimson-mediated bionic coding to acoustic coding as well as to previous data acquired with optogenetic stimulation using other ChRs and with electrical stimulation. F-Chrimson enabled energy-efficient stimulation of the auditory pathway at rates [≥]150 Hz, outperforming the slower ChR variants CatCh (blue) and ChReef (green). Intensity and frequency coding were comparable to previous observations with CatCh and outperformed electrical stimulation. ImpactThis study demonstrates near physiological temporal coding with the fast-closing ChR f-Chrimson, indicating that improved spectral coding by oCI is not traded off by poor temporal fidelity.

14

Sound Advice: A calibration framework for defining detection space in Passive Acoustic Monitoring

Sharma, P.; Kezia, K.; Seshadri, K. S.

2026-05-22 ecology 10.64898/2026.05.20.726556 medRxiv

Top 0.3%

0.3%

Show abstract

Passive Acoustic Monitoring (PAM) has emerged as a transformative tool for biodiversity assessment in recent years. Despite widespread acceptance and application for conservation-related outcomes, the synergistic effects of hardware limitations, signal propagation, and environmental conditions on how far a signal can be reliably detected remain critically understudied. We quantified changes in signal detectability using Autonomous Recording Units (ARUs) in a tropical agroecosystem using playback experiments of standardised pure-tone (1-8 kHz) in fallow rice paddy fields. We deployed a four-ARU array and broadcast signals over a 50- 300 m distance gradient, and modelled operative detectability of signals using a binomial Generalised Linear Mixed-effects Model (GLMM). Our findings show that the detection space of an ARU is highly frequency-dependent and environmentally modulated. Detection probability for low-frequency signals (1 kHz) decreased rapidly (50% threshold at [~]100 m), whereas mid-range frequencies (4-6 kHz) occupied an acoustic window that remained reliably detectable up to 250 m. Higher relative humidity significantly enhanced overall detection, while increasing temperatures disproportionately reduced low-frequency detectability. The orientation of the ARU to the signal source was important as the detection probability declined from 81% for recorders facing the source (0{degrees}) to 14% for rear-facing units (180{degrees}). Our findings underscore the importance of determining the detection space before undertaking PAM. We propose a Decision Support Framework that provides a pathway for researchers to integrate focal taxa traits with technical constraints to determine detection space and optimise study designs when using PAM for monitoring biodiversity and assessing conservation action.

15

Unpredictable Motion Shapes Sensing Behaviors Across Timescales

Cadigan, S. C.; Smith, N. A.; Jones, T.; Wohlgemuth, M.

2026-05-21 animal behavior and cognition 10.64898/2026.05.18.726036 medRxiv

Top 0.3%

0.3%

Show abstract

Locating, tracking, and intercepting objects is a fundamental behavior for many organisms. For instance, predators must track and capture erratically moving prey for their survival. Using the echolocating bat as a model species, we investigate how short-term changes in target motion predictability affect longer-term motor plans when tracking a prey item. We used a paradigm where prey motion is under experimental control, and then applied computational methods to characterize how target motion predictability influences short- and long-term behavioral control. We find that target motion predictability during the tracking phase of insect capture influences both short-term changes in sonar call control, as well as longer-term behavioral control for transitioning between hunting phases. For changes in immediate behavioral control, bats produce more bursts of calls at a higher rate when tracking unpredictable moving prey, an indication that the bat is collecting more information about the targets motion for unpredictable than predictable trials. In terms of longer-term behavioral control, target motion unpredictability delays the transition from tracking to capture phase behaviors. We suggest that the bat does this to collect more information about target motion to time the transition from tracking to capture behaviors for hunting success. Additionally, we find the effects of target motion unpredictability are first seen as changes in the vocal motor plan and then the auditory motor plan (ear motion), hinting at a sequencing of motor changes that warrant further investigation. SummaryWhen presented with a more challenging hunting task, bats will increase their production of bursts of calls at a higher rate and delay their transition into capture behaviors.

16

Iconic Sound-Shape Correspondences in Aphasia

Dorsi, J.; Sandberg, C.; Lacey, S.; Nygaard, L.; Sathian, K.

2026-05-19 neuroscience 10.64898/2026.05.18.725976 medRxiv

Top 0.4%

0.2%

Show abstract

PurposeTo examine speech iconicity for shape in aphasia, we compared iconicity ratings from people with aphasia to those from neurologically intact individuals and evaluated how iconicity relates to phonological and semantic processing profiles in aphasia. MethodEleven people with aphasia and 11 age- and gender-matched neurologically intact participants rated how rounded or pointed 50 auditory pseudowords sounded using a 5-point scale. Ratings from participants with aphasia were compared to predicted iconicity ratings derived from reference ratings from prior work and to ratings from neurologically intact participants. For each participant with aphasia, correlations between individual ratings and predicted ratings were related to measures of phonological and semantic processing. ResultsRatings from people with aphasia were significantly correlated with both the predicted ratings and the ratings from neurologically intact participants. The strength of the correlation between individual ratings and predicted ratings did not differ significantly between groups, although there was a trend toward weaker correlations in the aphasia group. There were indications that greater language impairment was associated with greater disruption of iconicity ratings; in particular, deficits in phonological segmentation and semantic processing were associated with reduced sensitivity to shape iconicity. ConclusionThese findings suggest that sensitivity to shape iconicity is preserved in individuals with aphasia to varying degrees. The specific nature of language impairment appears to play an important role in determining iconicity processing in aphasia.

17

Daytime heat exposure increases nighttime predation risk in a mangrove gastropod

Jawad, W. A.; Collin, R.; Dwane, C.; Kelly, M. W.

2026-05-13 ecology 10.64898/2026.05.10.723115 medRxiv

Top 0.4%

0.2%

Show abstract

O_LIThe frequency and intensity of heat events is increasing across marine and terrestrial ecosystems. Within the same ecological community, the relative exposure and sensitivity to heat stress may vary considerably among interacting species, like predators and prey. This can be especially true for species that interact at the aquatic-terrestrial interface, as well as for interactions between primarily nocturnal and diurnal species, making it difficult to predict how such communities will respond to habitat warming. C_LIO_LIThermal limit metrics such as CTmax are often assumed to equate with ecological death because such temperatures impair behavioral activity and/or physiological functioning. Prey that are diurnally active can be more frequently exposed to temperatures that approach CTmax compared to their nocturnal predators, which may use thermal refuges during the day. Yet the impacts of daytime heat exposure on nighttime predation risk remain unknown. C_LIO_LIHere, we compared the thermal environment, performance, and heat tolerance between the predatory blue crab, Callinectus sapidus and one of its prey species, the mangrove periwinkle Littoraria anguilifera in a tropical mangrove ecosystem. We examined how exposing prey to heat stress at and below their CTmax affected their capacity to avoid predation in the field at night when predation risk is highest. C_LIO_LIWe found that acute exposure to temperatures near CTmax during the day increased the prey species susceptibility to predation during recovery at night. Although both interacting predator and prey have high thermal tolerance, prey are exposed to conditions that already reach CTmax, suggesting that current extremes in temperatures may already be influencing vulnerability to predation in this ecosystem. C_LIO_LIOur results suggest that differential exposure to sublethal heat stress in diurnal prey relative to their predator, along with the subsequent impact of these exposures on predation risk, will play a role in shaping these interacting as climate warms. C_LI

18

Crouching tiger, hidden dangers: Avian fatality rates reduced by red-blade patterning at a species-rich African wind farm

Simmons, R. E.; Martins, M.; Peralta, F. C.

2026-05-07 ecology 10.64898/2026.05.04.722424 medRxiv

Top 0.4%

0.1%

Show abstract

Avian collision rates are certain to rise as renewable energy industries roll out wind and solar farms to reduce fossil fuel impacts in biologically diverse areas of the world. Technological solutions are often sought to decrease mortality rates, but for developing nations automated shut downs are expensive, and alternatives required. A promising route is to increase blade visibility to birds using high contrast colours. Despite the success of the solid black-blade experiment in Norway only one other black-blade field-study in the Netherlands has explored this possibility, with no significant results. We tested the use of colour-patterned blades at a species-rich, 37-turbine, wind facility in Hopefield, South Africa. Two broad "signal red" stripes were applied to a single blade at four high-fatality turbines, in 2023 by Umoya Energy. Avian fatality rates were compared before and after painting using the Before-After-Control-Impact (BACI) approach. Seventy-five fatalities of 23 species of raptors, passerines and wetland species over 24 months were compared for the same 20 turbines after patterning with two sets of controls: (i) their four nearest neighbours (NN) and (ii) all 16 controls (AC). Over 32 months 25 fatalities were recorded, 23 occurred at the controls and only two at the patterned turbines. Testing with Bayesian Generalized Linear Models (BGLMs) revealed a median 83% reduction in fatalities at the patterned blades for both the NN turbines (credible intervals 14% - 98%) and the AC comparisons (30% - 97%). Bayes Factors (BF) revealed strong statistical support for NN (BF = 49.9) and AC comparisons (BF = 159). There was little evidence that birds avoiding patterned turbines increased fatalities at the neighbouring turbines as there was a small median 15% increase in fatality rates when NN controls were compared with other controls, and weak statistical support (BF = 0.15). Among 14 raptor species recorded on site, 10 species have suffered fatalities. Of seven individuals killed prior to treatment at the four patterned blades, only one was killed post-treatment suggesting blade patterning is equally effective at reducing raptor fatalities. Our results show that patterned blades had a high probability (83%) of reducing fatalities with strong statistical support despite the small samples. This supports the Norway experiment in a high diversity African setting, but with red patterns not a solid black design. The strong effect of red stripes may arise from both the high contrast it provides and the possible warning effect that red may elicit. We call for additional experiments to differentiate the effect of patterns and colours for the optimal design to reduce avian-turbine collisions.

19

Optimisation of OptoDrum protocol for measuring optomotor response in juvenile & adult zebrafish

Super, R.; Bui, B. V.; Xie, J.; Bou-Antoun, P.; Scholz, L.; Jusuf, P. R.

2026-05-21 neuroscience 10.64898/2026.05.20.720959 medRxiv

Top 0.5%

0.1%

Show abstract

Zebrafish (Danio rerio) are an important vertebrate model for vision and neuroscience research. In the larval stages, the aquatic species begins to elicit the optomotor response (OMR) to stabilize themselves in water -- a behaviour that may be exploited in the laboratory to measure visual acuity. However, up to now, the measurement of the OMR in juvenile and adult zebrafish has been limited due to their behavioural complexity. Here, we optimize a protocol to assay zebrafish aged between 4 and 9 weeks-post-fertilization, by displaying sinusoidal gratings parallel to the zebrafish eye to elicit a robust OMR. We assessed the visual spatial-frequency tuning function of an environmentally induced myopia model to confirm the sensitivity and robustness of the protocol. Additionally, we show the OMR is sensitive to the contrast and temporal resolution of the sinusoidal gratings. Furthermore, we found that the time between stimulus presentations impact the spatial-frequency tuning function likely as time is required for zebrafish to return to baseline swimming after eliciting the OMR. Finally, we found that the OMR after ten versus twenty seconds of stimulus onset appears comparable; indicating that robust OMR responses in zebrafish can be elicited through relatively short stimulus presentations. Through the experiments conducted, we present an optimized protocol specific to zebrafish. The protocol may be used to follow the progression or treatment efficacy of progressive neurological disorders including specific visual disorders and higher brain functions with visual endophenotypes. Ultimately, this protocol allows for high-throughput robust measures of visual and neural function in zebrafish.

20

Human and pet multimodal cues intensify wildlife fear responses

Hirobe, K.; Senzaki, M.

2026-05-16 ecology 10.64898/2026.05.13.725053 medRxiv

Top 0.6%

0.1%

Show abstract

O_LIFear of humans can drive persistent changes in wildlife behavioural and life-history traits, with cascading effects on entire ecosystems. Human multimodal cues and pet cues may influence impact of such fear, yet no study has tested how wildlife fear responses change when human acoustic cues and pet visual and acoustic cues are added to human visual cues. Filling this gap is important for managing human and pet outdoor activities while conserving wildlife. C_LIO_LIHere, with dogs representing the pet, we tested the effects of human and dog cues on fear responses of wild sika deer (Cervus nippon yesoensis) in approximately 800 km2 area, northern Japan, using alert distance (AD) and flight initiation distance (FID). First, we measured AD/FID with an approaching surveyor alone and with additional cues. Then, we fitted linear mixed-effects models while controlling for key covariates. C_LIO_LIFrom analyses with 266 observations, AD was estimated at 80.0 m with the human visual cue alone, and dog barking increased AD by 18.4m. FID was estimated at 57.1 m with the human visual cue alone, and human voice and the dog decoy increased FID by 11.3m and by 8.5 m, respectively. C_LIO_LIThese results demonstrate that human multimodal cues and pet cues can increase prey fear responses. Our findings also suggest that dog walking may expose wildlife to simultaneous human and pet cues more consistently than predator co-occurrence typically does in nature. The increase in FID with human acoustic cues, in contrast to previous studies, suggests that animals may shift cue weighting depending on predator species, potentially using human voices to help identify the threat as human. C_LIO_LIPrevious studies show that multimodal predator cues increase prey fear responses, and our findings extend this flamework to fear responses towards humans. Our findings can inform more tolerant management of human recreation and pet walking in sensitive areas. Reducing human and pet cues through signage, guidance, and zoning may prevent flight and associated energy expenditure, whereas mitigating vigilance may require behavioural guidance and spacing between pet-walking visitors. Overall, shaping how humans and pets behave may be more practical than blanket restriction. C_LI